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Sequence Listing 



(1) GENERAL INFORMATION: 

(i) APPLICANT: : . 

(A) NAME: Max-Planck-Gesellschaf t zur Forderung der 

Wissenschaften" e,V, 

(B) STREET: : none 

(C) CITY: Berlin 

(E) COUNTRY: Germany 

(F) ZIP CODE: none 

(ii) TITLE OF THE INVENTION: Nucleic acid molecules encoding proteins 
which impart the adhesion of Neisseria cells to human cells 

(iii) NUMBER OF SEQUENCES: 4 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentin Release #1.0, Version #1.30 (EPA) 

(vi) DATA OF PRIOR APPLICATION: 

(A) APPLICATION NUMBER: DE 19534579.7 

(B) FILING DATE: 18-SEP-1995 



(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3287 base pairs 

(B) TYPE: Nucleotide 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTISENSE: NO 

(vi) IMMEDIATE SOURCE: 

(A) ORGANISM: Neisseria "gongrrheae 

(B) STRAIN: MSll 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: genomic library in pBA 

(B) CLONE (S) : H1967/pES25 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 136. .4 47 
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(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 583. . 1542 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1585. .3111 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

CGGCGCAAAC GGCGGACGCT GCTGTTAGCC CCGCTTGAAA CAAATGCCGT CTGAACGCCA 

CTTCAGACGG CATTTTTATA ATAAGGCGCT GTCCTAGATA ACTAGGGAAA TTCAAATTAA 

GTTAGAATTA TCCCT ATG AGA AAA AGO CGT CTA AGC CGG TAT AAA CAA AAT 
Met Arg Lys Ser Arg Leu Ser Arg Tyr Lys Gin Asn 
1 5 10 ' 

AAA CTC ATT GAA CTG TTT GTC GCA GGC GTA ACT GCA AGA ACA GCA GCA 
Lys Leu lie Glu Leu Phe Val Ala Giy Val Thr Ala Arg Thr Ala Ala 

15 ' ■ 20 25 . 

GAG CCT GAC AGC ATT GTT TAT ACG GAT TGT TAT CGT CGC TAT GAT GTA 
Glu Pro Asp Ser lie Val Tyr Thr Asp Cys Tyr Arg Arg Tyr Asp Val 
30 35 40 

TTG GAT GCG GGC GAA TTT AGC CAT TTC CGT ATC AAT CAC AGC ACA CAT 
Leu Asp Ala Gly Glu Phe Ser His Phe Arg lie Asn His Ser Thr His 
45 50 55 60 

TTT GCC GAA CGA CAA AAC CAT ATT AAT GGA ATT GGG AAC TTT TGG AAC 
Phe Ala Glu Arg Gin Asn His lie Asn Gly lie Gly Asn Phe Trp Asn 
65 . 70 75 

CGG GCA AAA CGT CAT TTA CGC AAG TTT GAC GGC ATT CCC AAA GAG CAT 
Arg Ala Lys Arg His Leu Arg Lys Phe Asp Gly lie Pro Lys Glu His 
80 - .85 90 . 

TTT GAG CCG TAT TTA AAG GAG TGC GAA CGG CGT TTT TAACAACAGT 
Phe Glu Pro Tyr Leu Lys Glu Cys Glu Arg Arg Phe 
95 100 

GAGATAAAAG TTCTTGTTCC ATTTTAAAAC AATTAGTAAA ATCGAGTTTA TCCTAGTTGT 

CCAGGACGGC CCCTAATTTA TTTACAATTT TGATACAATT TGTTTTTCAT CAAAGGAGAA 

AATCT ATG CGG GCA CGG CTG CTG ATA CCT ATT CTT TTT TCG GTT TTT 
Met Arg Ala Arg Leu Leu lie Pro lie Leu Phe Ser Val Phe 
1 5 10 

ATT TTA TCC GCC TGC GGG ACA CTG ACA GGT ATT CCA TCG CAT GGC GGA 
lie Leu Ser Ala Cys Gly Thr Leu Thr Gly lie Pro Ser His Gly Gly 
15. 20 25 30 
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GGC AAA CGC TTC GCG GTC GAA CAA GAA CTT GTG GCC GCT TCT GCC AGA 720 
Gly Lys Arg Phe Ala Val Glu Gin Glu Leu Val Ala Ala Ser Ala Arg 
- 35 40 45 

GCT GCC GTT AAA GAC ATG GAT TTA CAG GCA TTA CAC GGA CGA AAA GTT 768 
Ala Ala Val Lys Asp Met Asp Leu Gin Ala Leu His Gly Arg Lys Val 
50 55 ,60 

GCA TTG TAC ATT GCA ACT ATG GGC GAC CAA GGT TCA GGC AGT TTG ACA 816 
Ala Leu Tyr lie Ala Thr Met Gly Asp Gin Gly Ser Gly Ser Leu Thr 
65 70 75 

GGG GGT CGC TAC TCC ATT GAT GCA CTG ATT CGC GGC GAA TAC ATA AAC 864 
Gly Gly Arg Tyr Ser lie Asp Ala Leu lie Arg Gly Glu Tyr lie Asn 

80 85 90 • ■ 

AGC CCT GCC GTC CGC ACC GAT TAC ACC TAT CCG CGT TAC GAA ACC ACC 912 
Ser Pro Ala Val Arg Thr Asp Tyr Thr Tyr Pro Arg Tyr Glu Thr Thr 
95 100 . • 105 110 

GCT GAA ACA ACA TCA GGC GGT TTG ACG GGT TTA ACC ACT TCT TTA TCT 960 
Ala Glu Thr Thr Ser Gly Gly Leu Thr Gly Leu Thr Thf Ser Leu Ser 
115 120 125 

ACA CTT AAT GCC CCT GCA CTC TCG CGC ACC CAA TCA GAC GGT AGC GGA .1008 
Thr Leu Asn Ala Pro Ala Leu Ser Arg Thr Gin Ser Asp Gly Ser Gly 
130 135 140 

AGT AGG AGC AGT CTG GGC TTA AAT ATT GGC GGG ATG GGG GAT TAT CGA 1056 
Ser Arg. Ser Ser Leu Gly Leu Asn lie Gly Gly' Met Gly Asp Tyr Arg 
145 150 155 

AAT GAA ACC TTG ACG ACC AAC CCG CGC GAC ACT GCC TTT CTT TCC CAC 1104 
Asn Glu Thr Leu Thr Thr Asn Pro Arg Asp Thr Ala Phe Leu Ser His 
160 165 .170 

TTG GTA CAG ACC GTA TTT TTC CTG CGC GGC ATA GAC GTT GTT TCT CCT 1152 
Leu Val Gin Thr Val Phe Phe Leu Arg Gly lie Asp Val Val Ser Pro 
175 180 185 190 

GCC AAT GCC GAT ACA GAT GTG TTT ATT AAC ATC GAC GTA TTC GGA ACG 1200 
Ala Asn Ala Asp Thr Asp Val Phe lie Asn lie Asp Val Phe Gly Thr 
195 200 205 ' 

ATA CGC AAC AGA ACC GAA ATG CAC CTA TAC AAT GCC GAA ACA CTG AAA 124 8 

lie Arg Asn Arg Thr Glu Met His Leu Tyr Asn Ala Glu Thr Leu Lys 
210 215 220 

GCC CAA ACA AAA CTG GAA TAT TTC GCA GTA GAC AGA ACC AAT AAA AAA 1296 
Ala Gin Thr Lys Leu Glu Tyr Phe Ala Val Asp Arg Thr Asn Lys Lys 
225 230 235 

TTG CTC ATC AAA CCC AAA ACC AAT GCG TTT GAA GCT GCC TAT AAA GAA 134 4 

Leu Leu lie Lys Pro Lys Thr Asn Ala Phe Glu Ala Ala Tyr Lys Glu 
240 .245 . 250 
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AAT TAG GCA TTG TGG ATG GGG CCG TAT AAA GTA AGC AAA GGA ATC AAA 
Asn Tyr Ala Leu Trp Met Giy Pro Tyr Lys Val Ser Lys Gly lie Lys 
255 260 265 270 

CCG ACG GAA GGA TTA ATG GTC GAT TTC TCC GAT ATC CGG CCA TAG GGC 
Pro Thr Glu Giy Leu Met Val Asp Phe Ser Asp lie Arg Pro Tyr Gly 
275 280 285 

AAT CAT ACG GGT AAC TCC GCC CCA TCC GTA GAG GCT GAT AAC AGT CAT 
Asn His Thr Gly Asn Ser Ala Pro Ser Val Glu Ala Asp Asn Ser His 
290 295 300 

GAG GGG TAT GGA TAG AGC GAT GAA GCA GTG CGA CAA CAT AGA CAA GGG 
Glu Gly Tyr Gly Tyr Ser Asp Glu Ala Val Arg Gin His Arg Gin Gly 
305 310 315 

CAA CCT TGATTCACAC TGCCATAACC GCTTGCTGCC AAGGAAAACA AA ATG AAT 
Gin Pro Met Asn 

320 ' 1 

TTG CCT ATT CAA AAA TTC ATG ATG CTG TTT GCA GCG GCA ATA . TCG TTG 
Leu Pro lie Gin Lys Phe Met Met Leu Phe Ala Ala Ala lie Ser Leu 
5 10 15 

CTG CAA ATC CCC ATT AGT CAT GCG AAC GGT TTG GAT GCC CGT TTG CGC 
Leu Gin lie Pro lie Ser His Ala Asn Gly Leu Asp Ala Arg Leu Arg 
20 25 30 

GAT GAT ATG CAG GCA Ai\A CAC TAG GAA CCG GGT GGC AAA TAG CAT CTG 
Asp Asp Met Gin Ala Lys His Tyr Glu Pro Gly Gly Lys Tyr His Leu 
35 40 45 50 

TTC GGT AAT GCT CGC GGC AGT GTT AAA AAT CGG GTT TGC GCC GTC CAA 
Phe Gly Asn Ala Arg Gly Ser Val Lys Asn Arg Val Cys Ala Val Gin 
55 . 60 65 

ACA TTT GAT GCA. ACT GCG GTC GGC CCC ATA CTG CCT ATT ACA CAC GAA 
Thr Phe Asp Ala Thr Ala Val Gly Pro lie Leu Pro lie Thr His Glu 
70 75 80 

CGG ACA GGG TTT GAA GGC ATT ATC GGT TAT GAA ACC CAT TTT TCA GGA 
Arg Thr Gly Phe Glu Gly lie lie Gly Tyr Glu Thr His Phe Ser Giy 
85 90 95 

CAC GGA CAC GAA GTA CAC AGT CCG TTC GAT AAT CAT GAT TCA AAA AGC 
His Gly His Glu Val His Ser Pro Phe Asp Asn His Asp Ser Lys Ser 
100 105 110 

ACT TCT GAT TTC AGC GGC GGC GTA GAG GGC GGT TTT ACC GTT TAG CAA 
Thr Ser Asp Phe Ser Gly Gly Val Asp Gly Gly Phe Thr Val Tyr Gin 
115 120 125 130 

CTT CAT CGG ACA GGG TCG GAA ATA CAT CCC GCA GAG GGA TAT GAG GGG 
Leu His Arg Thr Gly Ser Glu lie His Pro Ala Asp Gly Tyr Asp Gly 
135 140 145 
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CCT CAA GGG GGC GGT TAT CCG GAA CCA GAA GGG GCA AGG GAT ATA TAG 
Pro Gin Gly Gly Gly Tyr Pro Glu Pro Gin Gly Ala Arg Asp lie Tyr 
150 155 160 

AGO TAG CAT ATC AAA GGA ACT TCA ACG AAA ACA AAG ATA AAC ACT GTT 
Ser Tyr His lie Lys Gly Thr Ser Thr Lys Thr Lys lie Asn Thr Val 
165 170 175 

CCG CAA GCC CCT TTT TCA GAC CGC TGG CTA AAA GAA AAT GCC GGT GCC 
Pro Gin Ala Pro Phe Ser Asp Arg Trp Leu Lys Glu Asn Ala Gly Ala 
180 185 190 

GCT TCC GGT TTT CTC AGC CGT GCG GAT GAA GCA GGA AAA CTG ATA TGG 
Ala Ser Gly Phe Leu Ser Arg Ala Asp Glu Ala Gly Lys Leu lie Trp 
195 200 205 210 

GAA AAC GAC CCC GAT AAA AAT TGG CGG GCT AAC CGT ATG GAT GAT ATT 
Glu Asn Asp Pro Asp Lys Asn Trp Arg Ala Asn Arg Met Asp Asp lie 
215 220 225 

CGC GGC ATC GTC CAA GGT GCG GTT AAT CCT TTT TTA ACG GGT TTT GAG 
Arg Gly lie Val Gin Gly Ala Val Asn Pro Phe Leu Thr Gly Phe Gin 
230 235 240 

GGA TTG GGA GTT GGG GCA ATT ACA GAC AGT GCG GTA AGC CCG GTA ACC 
Gly Leu Gly Val Gly Ala lie Thr Asp Ser Ala Val Ser Pro Val Thr 
245 250 255 

TAT GCG GCA GCA CGG AAA ACT TTA CAG GGT ATT CAC AAT TTA GGA AAT 
Tyr Ala Ala Ala Arg Lys Thr Leu Gin Gly lie His Asn Leu Gly Asn 
260 265 270 

TTA AGT CCG GAA GCA CAA CTT GCC GCC GCG AGC CTA TTA CAG GAC AGT 
Leu Ser Pro Glu Ala Gin Leu Ala Ala Ala Ser Leu Leu Gin Asp Ser 
275 280 . 285 290 

GCC TTT GCG GTA AAA GAC GGC ATC AAT TCC GCC AGA CAA TGG GCT GAT 
Ala Phe Ala Val Lys Asp Gly He Asn Ser Ala Arg Gin Trp Ala Asp 
295 300 305 

GCC CAT GCG AAT ATA ACA GCA ACA GCC CAA ACT GCC CTT GCC GTA GCA 
Ala His Pro Asn He Thr Ala Thr Ala Gin Thr Ala Leu Ala Val Ala 
310 315 320 

GAG GCT GCA GGT ACG GTT TGG GGA GGT AAA AAA GTA GAA CTT AAC CCG 
Glu Ala Ala Gly Thr Val Trp Gly Gly Lys Lys Val Glu Leu Asn Pro 

325 330 335 

ACC AAA TGG GAT TGG GTT AAA AAT ACC GGC TAT GAA AAA CCT GCT GCC 
Thr Lys Trp Asp Trp Val Lys Asn Thr Gly Tyr Glu Lys Pro Ala Ala 
340 345 350 

CGA CCT ATG CAG ACT GTA GAC GGG GAA ATG GCC GGG AAA AAT AAG CCA 
Arg Pro Met Gin Thr Val Asp Gly Glu Met Ala Gly Lys Asn Lys Pro 
355 360 365 370 
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CCG AAA CCA ACT ACG CAG CAA CAC TCT ACA. CAC TCT GAT AAC AAT ATC 27 42 

Pro Lys Pro Ser Thr Gin Gin His Ser Thr His Ser Asp Asn Asn lie 
375 380 385 

GGC TTA CCT GCC CCA TAT GTT AAA CCT GAT ACA TCT ATT TCT CCG ACA 27 90 

Gly Leu Pro Ala Pro Tyr Val Lys Pro Asp Thr Ser lie Ser Pro Thr 
390 395 400 

GGA ACA ATT CAA GAC CGC ATC AGA TGG ACA AAA TCC AAG TTT CCT ACT 28 38 

Gly Thr lie Gin Asp Arg lie Arg Trp Thr Lys Ser Lys Phe Pro Thr 
405 410 415 

GAG AAA TCT TTA AAT GGA CAT TTC AAA GCT CAT GGA AAA GAA TTT GGC 2886 
Glu Lys Ser Leu Asn Gly His Phe Lys Ala His Gly Lys Glu Phe Gly 
420 425 430 

GAT ATA ACC ATT GAA GAC TAC CAA AAA ATG GCG TCT GAT TTG TTA TCA 2 934 

Asp lie Thr lie Glu Asp Tyr Gin Lys Met Ala Ser Asp Leu Leu Ser 
435 440 445 450 

AAA CAG ACA TCG GAC AAG ATA TTA GQT TAT CAG ACG GAA CAT AGA CGA 2982 
Lys Gin Thr Ser Asp Lys lie Leu Gly Tyr Gin Thr Glu His Arg Arg 
455 460 465 

GTG CGC TAT. GAT ATC AAT AAC AAT ATC TAT GTT TTG GCC AAT CCA AAA 3030 
Val Arg Tyr Asp lie Asn Asn Asn lie Tyr Val Leu Ala- Asn Pro Lys 
470 475 480 

ACA TTC AAA ATC AAA ACA ATG TTT AAA CCA AAC TTA GGA AAG GAG TAT . 3078 
Thr Phe Lys lie Lys Thr Met Phe Lys Pro Asn Leu Gly Lys Glu Tyr 
485 490 495 

TAT GAT GGA GAA TTC AAA AAA GAC ATG GGA AAT TGACGGAGAA ATATGGCTAC 3131 
Tyr Asp Gly Glu Phe Lys Lys Asp Met Gly Asn 
500 505 

ATTGTCCTGT TTGCGGAACT GAAGTTATGG ACTATGATAT CTGTGACGTT TGTCAGTGGC 3191 

AAAATACAGG AGAAACTAAT ATAGATGGTG GTCCTAATGA AATGACACTT GCGGAGGCGA 3251 

AAGAAGCTTA CGCAAAAGGC TTACCAATCA GATAAA 328 7 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1136 base pairs 

(B) TYPE: Nucleotid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTISENSE: NO 

(vi) IMMEDIATE SOURCE: 

(A) ORGANISM: Neisseria gonorrheae 



31 



,{B) STRAIN: MSll 

(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 135. ,1094 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
AACAACAGTG AGATAAAAGT TCTTGTTCCA TTTTAAAACA ATTAGTAAAA TCGAGTTTAT 

CCTAGTTGTC CAGGACGGCC CCTAATTTAT TTACAATTTT GATACAATTT GTTTTTCATC 

AAAGGAGAAA ATCT ATG CGG GCA CGG CTG CTG ATA CCT ATT CTT TTT TCG 
Met Arg Ala Arg Leu Leu lie Pro lie Leu Phe Ser 
510 515- 520 

GTT TTT ATT TTA TCC GCC TGC GGG ACA CTG ACA GGT ATT CCA TCG GAT 
Val Phe lie Leu Ser Ala Cys Gly Thr Leu Thr Gly lie Pro Ser His 
525 530 535 

GGC GGA GGC AAA CGG TTC GCG GTG GAA CAA ' GAA CTT GTG GCC GCT TCT 
Gly Gly Gly Lys Arg Phe Ala Val Glu Gin Glu Leu Val Ala Ala Ser 
540 545 550 

GCC AGA GCT GCC GTT AAA GAG ATG GAT TTA CAG GCA TTA CAC GGA CGA 
Ala Arg Ala Ala Val Lys Asp Met Asp Leu Gin Ala Leu His Gly Arg 
555 560 565 

AAA GTT GCA TTG TAG ATT GCA ACT ATG GGC GAG CAA GGT TCA GGC-AGT 
Lys Val Ala Leu Tyr lie Ala Thr Met Gly Asp Gin Gly Ser Gly Ser 
570 575 580 585 

TTG ACA GGG GGT GGC TAG TCC ATT GAT GCA CTG ATT GGC GGC GAA TAG 
Leu Thr Gly Gly Arg Tyr .Ser lie Asp Ala Leu lie Arg Gly Glu Tyr 
590 • 595 600 

ATA AAC AGC CCT GCC GTC GGC ACC GAT TAG ACC TAT GCG CGT TAG GAA 
lie Asn Ser Pro Ala Val Arg thr Asp Tyr Thr Tyr Pro Arg Tyr Glu 
605 610 615 

ACC ACC GCT GAA ACA ACA TCA GGC GGT TTG ACG GGT TTA ACC ACT TCT 
Thr Thr Ala Glu Thr Thr Ser Gly Gly Leu Thr Gly Leu Thr Thr Ser 
620 625 630 



TTA TCT ACA CTT AAT GCC CCT GCA CTG TCG CGC ACC CAA TCA GAC GGT 

Leu Ser Thr Leu Asn Ala Pro Ala Leu Ser Arg Thr Gin Ser Asp Gly 
635 640 645 

AGC GGA AGT AGG AGC AGT CTG GGC TTA AAT ATT GGC GGG ATG GGG GAT 

Ser Gly Ser Arg Ser Ser Leu Gly Leu Asn lie Gly Gly ^!et Gly Asp 

650 655 660^ 665 
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TAT CGA AAT GAA ACC TTG ACQ ACC AAC CCG CGC GAG ACT GCG TTT CTT 650 
Tyr Arg Asn Glu Thr Leu Thr Thr Asn Pro Arg Asp Thr Ala Phe Leu 
670 675 680 

TCC CAC TTG GTA GAG ACC GTA TTT TTC CTG CGC GGC ATA GAC GTT GTT 698 
Ser His Leu Val Gin Thr Val Phe Phe Leu Arg Gly lie Asp Val Val 
* 685 690 695 

TCT CCT GCC AAT GCC GAT ACA GAT GTG TTT ATT AAC ATC GAC GTA TTC 74 6 

Ser Pro Ala Asn Ala Asp Thr Asp Val Phe lie Asn lie Asp Val Phe 
700 705 710 

GGA ACG ATA CGC AAC AGA ACC GAA ATG CAC GTA TAG AAT GCC GAA ACA 794 
Gly Thr lie Arg Asn Arg Thr Glu Met His Leu Tyr Asn Ala Glu Thr 
715 720 725 

CTG AAA GCC CAA ACA AAA CTG GAA TAT TTC GCA GTA GAC AGA ACC AAT 8 42 

Leu Lys Ala Gin Thr Lys Leu Glu Tyr Phe Ala Val Asp Arg Thr Asn 
730 735. 740 745 

AAA AAA TTG CTG ATC AAA CCC AAA ACC AAT GCG TTT GAA GCT GCC TAT 8 90 

Lys Lys Leu Leu lie Lys Pro Lys Thr Asn Ala Phe Glu Ala Ala Tyr 
750 755 760 

AAA GAA AAT TAG GCA TTG TGG ATG GGG CCG TAT AAA GTA AGC AAA GGA 938 
Lys Glu Asn Tyr Ala Leu Trp Met Gly Pro Tyr Lys Val Ser Lys Gly 
765 770 775 

ATC AAA CCG ACG GAA GGA TTA ATG GTC GAT TTC TCC GAT ATG GGG CCA 98 6 

lie Lys Pro Thr Glu Gly Leu Met Val Asp Phe Ser Asp lie Arg Pro 
780 785 790 

TAG GGC AAT CAT ACG GGT AAC TCC GCC CCA TCC GTA GAG GCT GAT AAC 1034 
Tyr Gly Asn His Thr Gly Asn Ser Ala Pro Ser Val Glu Ala Asp Asn 
795 -800 805 

AGT CAT GAG GGG TAT GGA TAG AGC GAT GAA GCA GTG CGA CAA CAT AGA .1082 
Ser His Glu Gly Tyr Gly Tyr Ser Asp Glu Ala Val Arg Gin His Arg 
810 815 820 825 

CAA GGG CAA CCT TGATTCACAC TGCCATAACC GCTTGCTGCC AAGGAAAACA 1134 
Gin Gly Gin Pro 



AA 1136 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 582 • base pairs 

(B) TYPE: Nucleotide 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
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(iii) HYPOTHETICAL: NO 
(iv) ANTISENSE: NO 



(vi) IMMEDIATE SOURCE: 

(A) ORGANISM: Neisseria gonorrheae 

(B) STRAIN: MSll 

(ix) FEATURE: 

(A) NAME/BCEY: CDS 

(B) LOCATION: 136. , 447 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

CGGCGCAAAC GGCGGACGCT GCTGTTAGCC CCGCTTGAAA CAAATGCCGT CTGAACGCCA 

CTTCAGACGG CATTTTTATA ATAAGGCGCT GTCCTAGATA ACTAGGGAAA TTCAAATTAA 

GTTAGAATTA TCCCT ATG AGA AAA AGC CGT CTA AGC CGG TAT AAA CAA AAt 
Met Arg Lys Ser Arg Leu Ser Arg Tyr Lys Gin Asn 
325 330 

AAA CTC ATT GAA CTG TTT GTC GCA GGC GTA ACT GCA AGA ACA GCA GCA 
Lys Leu lie Glu Leu Phe Val Ala Gly Val Thr Ala Arg Thr Ala Ala 
335 340 345 . 

GAG CCT GAC AGC ATT GTT TAT ACG GAT TGT TAT CGT CGC TAT GAT GTA 
Glu Pro Asp Ser lie Val Tyr Thr Asp Cys Tyr Arg Arg Tyr Asp Val 
350 355 360 

TTG GAT GCG GGC GAA TTT AGC CAT TTC CGT ATC AAT CAC AGC ACA CAT 
Leu Asp Ala Gly Glu Phe Ser His Phe Arg lie Asn His Ser Thr His 
365 370 375 380 

TTT GCC GAA CGA CAA AAC CAT ATT AAT GGA ATT GGG AAC TTT TGG AAC 
Phe Ala Glu Arg Gin Asn His lie Asn Gly lie Gly Asn Phe Trp Asn 
385 390 395 



CGG GCA AAA CGT CAT TTA CGC AAG TTT GAC GGC ATT CCC AAA GAG CAT 
Arg Ala Lys Arg His Leu Arg Lys Phe Asp Gly lie Pro Lys Glu His 
400 405 410 



TTT GAG CCG TAT TTA AAG GAG TGC GAA CGG CGT TTT TAACAACAGT 
Phe Glu Pro Tyr Leu Lys Glu Cys Glu Arg Arg Phe 
415 420 

GAGATAAAAG TTCTTGTTCC ATTTTAAAAC AATTAGTAAA ATCGAGTTTA TCCTAGTTGT 

CCAGGACGGC CCCTAATTTA TTTACAATTT TGATACAATT TGTTTTTCAT CAAAGGAGAA 

AATCT 

(2) INFORMATION FOR SEQ ID NO: 4 : . 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1744 base pairs 
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(B) TYPE: Nucleotide 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: Genomic DNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTISENSE: NO 

(vi) IMMEDIATE SOURCE: 

(A) ORGANISM: Neisseria gonorrheae 

(B) STRAIN: MSll 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 42. .1568 

(xi) SEQUENCE DESCRIPTION:- SEQ ID NO: 4: 
GATTCACACT GCCATAACCG CTTGCTGCCA AGGAAAACAA A ATG AAT TTG CCT 53 



Met Asn Leu Pro 
105 



ATT CAA AAA TTC ATG ATG CTG TTT GCA GCG GCA ATA TCG TTG CTG CAA 
lie Gin Lys Phe Met Met Leu Phe Ala Ala Ala lie Ser Leu Leu Gin 
110 115 120 



101 



ATC CCC ATT AGT CAT GCG AAC GGT TTG GAT GCC CGT TTG CGC GAT GAT 
lie Pro lie Ser His Ala Asn Gly Leu Asp Ala Arg Leu Arg Asp Asp 
125 130 135 . 140 
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• ATG CAG GCA AAA CAC TAC GAA CCG GGT GGC AAA TAG CAT CTG TTC GGT 
Met Gin Ala Lys His Tyr Glu Pro Gly Gly Lys Tyr His Leu Phe Gly 
145 150 155 



197 



AAT GOT CGC GGC AGT GTT AAA AAT CGG GTT TGC GCC GTC CAA ACA TTT 
Asn Ala Arg Gly Ser Val Lys Asn Arg Val Cys Ala Val Gin Thr Phe 
160 165 170 



245 



GAT GCA ACT GCG GTC GGC CCC ATA CTG CCT ATT ACA CAC GAA CGG ACA 
Asp Ala Thr Ala Val Gly Pro lie Leu Pro lie Thr His Glu Arg Thr 
175 180 185 
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GGG TTT GAA GGC ATT ATC GGT TAT GAA ACC CAT TTT TCA GGA CAC GGA 
Gly Phe Glu Gly lie He Gly Tyr Glu Thr His Phe Ser Gly His Gly 
190 195 200 



341 



CAC GAA GTA CAC AGT CCG TTC GAT AAT CAT GAT TCA AAA AGC ACT TCT 
His Glu Val His Ser Pro Phe Asp Asn His Asp Ser Lys Ser Thr Ser 
205 210 215 220 



389 



GAT TTC AGC GGC GGC GTA GAG GGC GGT TTT ACC GTT TAG CAA GTT CAT 
Asp Phe Ser Gly Gly Val Asp Gly Gly Phe Thr Val Tyr Gin Leu His 

225 230 235 



437 



CGG ACA GGG TCG GAA ATA CAT CCC GCA GAG GGA TAT GAC GGG CCT CAA 
Arg Thr Gly Ser Glu He His Pro Ala Asp Gly Tyr Asp Gly Pro Gin 



485 



35 



240 



245 



250 



GGC GGC GGT TAT CCG GAA CCA CAA GGG GCA AGG GAT ATA TAC AGC TAG 
Gly Gly Gly Tyr Pro Glu Pro Gin Gly Ala Arg Asp lie Tyr Ser Tyi: 
255 260 265 

CAT ATC AAA GGA ACT TCA ACC AAA ACA AAG ATA AAC ACT GTT CCG CAA 
His lie Lys Gly Thr Ser Thr Lys Thr Lys lie Asn Thr Val Pro Gin 
270 - 275 280 

GCC CCT TTT TCA G AC CGC TGG CTA AAA GAA AAT GCC GGT GCC GCT TCC 
Ala Pro Phe Ser Asp Arg Trp Leu Lys Glu Asn Ala Gly Ala Ala Ser 
285 290 295 300 

GGT TTT CTC AGC CGT GCG GAT GAA GCA GGA AAA CTG ATA TGG GAA AAC 
Gly Phe Leu Ser Arg Ala Asp Glu Ala Gly Lys Leu lie Trp Glu Asn 
305 310 315 

GAC CCC GAT AAA AAT TGG CGG GCT AAC CGT ATG GAT GAT ATT CGC GGC 
Asp Pro Asp Lys Asn Trp Arg Ala Asn Arg Met Asp Asp lie Arg Gly 
320 32.5 330 

ATC GTC CAA GGT GCG GTT AAT CCT TTT TTA . ACG GGT TTT CAG GGA TTG 
lie Val Gin Gly Ala Val Asn Pro Phe Leu Thr Gly Phe Gin Gly Leu 
335 340 345 v. 

GGA GTT GGG GCA ATT ACA GAC AGT GCG GTA AGC CCG GTA ACC TAT GCG 
Gly Val Gly Ala lie Thr Asp Ser Ala Val Ser Pro Val Thr Tyr Ala 
350 355 360 

GCA GCA CGG AAA ACT TTA CAG GGT ATT CAC AAT TTA GGA AAT TTA AGT 
Ala Ala Arg Lys Thr Leu Gin Gly lie His Asn Leu Gly Asn Leu Ser 
365 370 .375 380 

CCG GAA GCA CAA CTT GCC GCC GCG AGC CTA TTA CAG GAC AGT GCC TTT 
Pro Glu Ala Gin Leu Ala Ala Ala Ser Leu Leu Gin Asp Ser Ala Phe 
' 385 390 395 

GCG GTA AAA GAC GGC ATC AAT TCC GCC AGA CAA TGG GCT GAT GCC CAT 
Ala Val Lys Asp Gly lie Asn Ser Ala Arg Gin Trp Ala Asp Ala His 
400 405 410 

CCG AAT ATA ACA GCA ACA GCC CAA ACT GCC CTT GCC GTA GCA GAG GCT 
Pro Asn lie Thr Ala Thr Ala Gin Thr Ala Leu Ala Val Ala Glu Ala 
415 420 425 



GCA GGT ACG GTT TGG GGA GGT AAA AAA GTA GAA CTT AAC CCG ACC AAA 
Ala Gly Thr Val Trp Gly Gly Lys Lys Val Glu Leu Asn Pro Thr Lys 
430 435 440 
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TGG GAT TGG GTT AAA AAT ACC GGC TAT GAA AAA CCT GCT GCC CGA CCT 1109 
Trp Asp Trp Val Lys Asn Thr Gly Tyr Glu Lys Pro Ala Ala Arg Pro 
445 450 455 460 

ATG GAG ACT GTA GAG GGG GAA ATG GCC GGG AAA AAT AAG CCA CCG AAA 1157 
Met Gin Thr Val Asp Gly Glu Met Ala Gly Lys Asn Lys Pro Pro Lys 
465 470 475 



CCA AGT ACG GAG CAA CAC TCT ACA CAC TCT GAT AAC AAT ATG GGC TTA 1205 
Pro Ser Thr Gin Gin His Ser Thr His Ser Asp Asn Asn lie Gly Leu 
480 485 490 

CCT GCC CCA TAT GTT AAA CCT GAT ACA TCT ATT TCT CCG ACA GGA ACA 1253 
Pro Ala Pro Tyr Val Lys Pro Asp Thr Ser lie Ser Pro Thr Gly Thr 
495 500 505 

ATT CAA GAG CGC ATC AGA TGG ACA AAA TCC AAG TTT CCT ACT GAG AAA 1301 
lie Gin Asp Arg lie Arg Trp Thr Lys Ser Lys Phe Pro Thr Glu Lys 
510 515 520 



TCT TTA AAT GGA CAT TTC AAA GCT CAT GGA AAA GAA TTT GGC GAT ATA 134 9 

Ser Leu Asn Gly His Phe Lys Ala His Gly Lys Glu Phe Gly Asp lie 
525 530 535 540 

ACC ATT GAA GAG TAG CAA AAA ATG GGG TCT GAT TTG TTA TCA AAA GAG 1397 
Thr lie Glu Asp Tyr Gin Lys Met Ala Ser Asp Leu Leu Ser Lys Gin 
545 550 555 

ACA TGG GAG AAG ATA TTA GGT TAT GAG ACG GAA CAT AGA CGA GTG CGC 14 4 5 

Thr Ser Asp Lys lie Leu Gly Tyr Gin Thr Glu His Arg Arg Val Arg 
560 565 570 

TAT GAT ATC AAT AAC AAT ATC TAT GTT TTG GCC AAT CCA AAA ACA TTC 14 93 

Tyr Asp lie Asn Asn Asn lie Tyr Val Leu Ala Asn Pro Lys Thr Phe 
575 . 580 . 585 

AAA ATC AAA ACA ATG TTT AAA" CCA AAC TTA GGA AAG GAG TAT TAT GAT 1541 
Lys lie Lys Thr Met Phe Lys Pro Asn Leu Gly Lys Glu Tyr Tyr Asp 
590 595 600 

GGA GAA TTC AAA AAA GAG ATG GGA AAT TGACGGAGAA ATATGGCTAC 1588 
Gly Glu Phe Lys Lys Asp Met Gly Asn 
605 610 

ATTGTCCTGT TTGCGGAACT GAAGTTATGG ACTATGATAT CTGTGACGTT TGTCAGTGGC 1648 
AAAATACAGG AGAAACTAAT ATAGATGGTG GTCCTAATGA AATGACACTT GCGGAGGCGA 1708 



AAGAAGCTTA CGGAAAAGGC TTACCAATCA GATAAA 
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